Improved Parameterized Algorithms for Constraint Satisfaction 



Eun Jung Kim* 

LAMSADE-CNRS, Universite de Paris-Dauphine, 75775 Paris, France 

Ryan Williams^ 
Stanford University, Stanford, CA, USA 



Abstract 

For many constraint satisfaction problems, the algorithm which chooses a random assignment achieves 
the best possible approximation ratio. For instance, a simple random assignment for Max-E3-Sat allows 
7/8-approximation and for every e > there is no polynomial-time (7/8 + e)-approximation unless 
P=NP. Another example is the Permutation CSP of bounded arity. Given the expected fraction p of 
the constraints satisfied by a random assignment (i.e. permutation), there is no (p + ^-approximation 
algorithm for every e > 0, assuming the Unique Games Conjecture (UGC). 

In this work, we consider the following parameterization of constraint satisfaction problems. Given a 
set of m constraints of constant arity, can we satisfy at least pm + k constraint, where p is the expected 
fraction of constraints satisfied by a random assignment? Constraint Satisfaction Problems above 
Average have been posed in different forms in the literature [181117] . We present a faster parameterized 
algorithm for deciding whether ra/2 + fc/2 equations can be simultaneously satisfied over F2. As a 
consequence, we obtain 0(fc)-variable bikernels for BOOLEAN CSPs of arity c for every fixed c, and for 
permutation CSPs of arity 3. This implies linear bikernels for many problems under the "above average" 
parameterization, such as Max-c-Sat, Set-Splitting, Betweenness and Max Acyclic Subgraph. 
As a result, all the parameterized problems we consider in this paper admit 2 0( - fe '-time algorithms. 

We also obtain non-trivial hybrid algorithms for every Max c-CSP: for every instance 7, we can either 
approximate / beyond the random assignment threshold in polynomial time, or we can find an optimal 
solution to I in subexponential time. 
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1 Introduction 



The constraint satisfaction problem (CSP) is a general language to express many combinatorial problems 
such as graph coloring, satisfiability and various permutation problems. An instance of a CSP is a set V of 
variables, a domain D for the variables and C a set of constraints. The objective is to assign a value from D 
to each variable of V so as to maximize the number of satisfied constraints. For example, 3-Coloring can 
be seen as a CSP over a three-element domain, and the constraints correspond to edges (thus the arity of 
each constraint is 2), indicating that values assigned to the endpoints of an edge must differ. In this work, 
we are interested in two types of CSPs. In a boolean CSP, the domain D is { — In a permutation 
CSP, the size of the domain equals | V| and we request that the assignment is a bijection. 

As solving CSPs is NP-hard in general, the next question is whether they allow efficient approximation 
algorithms. Interestingly, many constraint satisfaction problems exhibit a hardness threshold, where it is 
relatively easy to obtain a feasible solution that satisfies a certain fraction of the optimum number of 
constraints, yet it is difficult to find a solution that is even slightly better. Perhaps the best known example 
is Max-E3-Sat, where we are given a CNF formula in which all clauses have exactly three literals, and wish 
to find a truth assignment satisfying as many clauses as possible. Although a uniform random assignment 
satisfies 7/8 of the clauses, Hastad [Tl] proved that it is NP-hard to satisfy 7/8 + e for every e > 0. The 
list of problems exhibiting a hardness threshold contains Max-Ec-Sat for c > 3, Max-Ec-Lin-2 for c > 3, 
and Ec-Set Splitting for c > 4. For all these problems, a uniform random assignment achieves the best 
approximation ratio based on P / NP. Furthermore, for these problems the lower bounds on the optimum 
are also tight in the sense that they are optimal for an infinite sequence of instances. 

For permutation CSPs, similar results have been identified, conditioned on the Unique Games Conjecture 
(UGC) of Khot [TS]. In the Betweenness problem, we have a set of betweenness constraints of the form 
"vi is between Vj and Vk" for distinct variables Vi,Vj,Vk £ V and the task is to find a permutation of the 
variables that satisfies the maximum number of constraints. More formally, constraints have the form 

(w(vj) < %(vi) < 7r(ufc)) V (ir(v k ) < n(vi) < n(vj)) 

and the task is to find a bijection 7r:V— »{1,...,|V|} that satisfies the maximum. We can satisfy one-third 
of the constraints in expectation by choosing a uniform random permutation. Moreover, it is hard to achieve a 
better approximation ratio, assuming UGC [5]. Hardness thresholds under UGC were known for permutation 
CSPs of arity 2 and 3 [HI [5]. Recently these results were generalized to arbitrary fixed arity |5]. Here, the 
lower bounds on the optimum obtained by a random assignment is tight as well: consider, for example, an 
instance of Betweenness in which we have all possible three constraints for triplets of variables. 

These threshold phenomena are fascinating in that they provide a sharp boundary between feasibility 
and infeasibility: while the average is easy to obtain, satisfying an "above average" fraction is intractable. 
While the identification of such thresholds is extremely interesting, it does not end the story but rather 
initiates another one. As it is likely that practitioners will require feasible solutions that exceed these easy 
thresholds, it is important to understand how much computational effort is required to solve a problem 
beyond its threshold. One way of cleanly formalizing this question uses parameterized complexity. 

A parameterized problem is a subset L C S* x N over a finite alphabet S. L is fixed-parameter tractable 
if the membership of (a;, fc) in E* x N can be decided in time |a:|°^ 1 ' ) • f(k) where / is a computable function 
of the parameter [7J. Given a pair of parameterized problems L and L', a bikernelization is a polynomial- 
time pre-processing algorithm that maps an instance (x,k) to an instance (x',k f ) (the bikernel) such that 
(i) (x, k) £ L if and only if (x',k') £ L', (ii) k 1 < f(k), and (iii) \x'\ < g(k) for some functions / and g. 
The function g{k) is called the size of the bikernel. A parameterized problem is fixed-parameter tractable if 
and only if it is decidable and admits a bikernelization [7 . A kernelization of a parameterized problem is a 
bikernelization to itself. For an overview of kernelization, see the recent survey [3] . 

Motivated by the discussion above, our work focuses on the following question: 

(permutation) Max-c-CSP Above Average: We are given a parameter k and a set of con- 
straints with at most c variables per constraint. Each constraint has a positive integer weight. 
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Determine if there is a variable assignment (or permutation) that satisfies a subset of constraints 
with total weight at least p ■ W + k, where W is the total weight of all constraints and p is the 
expected fraction of weighted constraints satisfied by a uniform random assignment. 

Previous Work. Parameterizations above a guaranteed value were first considered by Mahajan and Raman 
[TO] for the problems Max-Sat and Max-Cut. In a recent paper [T7], Mahajan, Raman and Sikdar argue, 
in detail, that a practical (and challenging) parameter for a maximization problem is the number of clauses 
satisfied above a tight lower bound, which is (1 — 2~ c )m for Max-Sat if each clause contains exactly c 
different variables. In the monograph by Neidermeier [18j . an open problem attributed to Benny Chor [25, 
p.43] asks whether Betweenness Above Average is fixed parameter tractable. 

A way for systematic investigation of above-average parameterization was recently presented by Gutin 
et al. [12j . They presented reductions to quadratic bikernels (i.e., bikernels with 0(k 2 ) variables) for the 
above-average versions of problems such as Maximum Acyclic Subgraph and Max-c-Lin-2. Alon et 
al. [T] pushed forward the idea of representing a CSP instance algebraically, presenting a quadratic kernel 
for Max-c-CSP using a similar method. This method was used to give a fixed-parameter algorithm for 
Betweenness Above Average [11], in which the idea of a coarse ordering is used. This result was later 
generalized in [13] to obtain a quadratic bikernel for Permutation Max-3-CSP. 

Our contribution. We show that every Max-c-CSP and Permutation Max-3-CSP admits a problem 
bikernel with only O(k) variables in the above-average parameterization. More precisely, we prove that 
essentially any hard instance of Max-c-CSP must have less than c(c + l)fe/2 variables; instances with more 
variables are yes-instances for which good assignments can be generated in polynomial time. This improves 
over the main results of Alon et al. [1 who gave kernels of 0(k 2 ) variables, Crowston et al. [6] for Max- 
c-CSP with O(fclogfc) variables and [13] for Permutation Max-3-CSP of (D(k 2 ) variables. This implies 
linear variable bikernels for the above-average versions of many different problems, such as Max-c-Lin-2, 
Max-c-Sat, Set Splitting when the sets are of size > 4, Maximum Acyclic Subgraph, Betweenness, 
Circular Ordering, and 3-Linear Ordering. 

The key to our results is a fixed-parameter algorithm for the following problem: 

Max-c-Lin-2 Above Average: We are given a parameter k and a system of linear equations 
over F2 with at most c variables per equation. Each equation e has a positive integer weight, and 
the weight of an assignment in the system is defined to be the total sum of weights of equations 
satisfied by the assignment. Determine if there is an assignment of weight at least W/2 + k/2, 
where W is the total weight of all equations. 

Note the random assignment algorithm yields W/2 weight, and it is famously NP-hard to attain W/2 + eW 
for every e > |14) . Our proofs imply the stronger result that every constraint satisfaction problem admits 
a hybrid algorithm, in the following sense: 

Theorem 1. For every Boolean Max-c-CSP, there is an algorithm with the property that, for every e > 0, 
on any instance I, the algorithm outputs either: 

• an optimal solution to I within (3*(2 c ( c+1 ) em / 2 ) time, or 

• a (p + e/2 c ) -approximation to I within polynomial time, where p is the expected fraction of weighted 
constraints satisfied by a uniform random assignment to the CSP. 

This resolves an open problem of Vassilevska, Williams, and Woo [20], who asked if Max-3-Sat had an 
algorithm of this form. 
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2 Preliminaries 



We define the Boolean Max-c-CSP and Max-c-Permutation CSP. A boolean constraint satisfaction 
problem is specified by the domain {— 1,+1} and a set of predicates P, called payoff functions as well. A 
predicate P G P is a function from {—1, +1} C to {0, 1} for c' < c. The maximum number c of inputs to the 
predicates in P is the arity of the problem. We interpret — 1 as the value True and +1 as False. 

An instance of boolean CSP is specified as a set of variables V along with a collection of triples C = 
{(fi,Si, w%), . . . , (fm, S m , w m )}, where /j G P, every Si is an ordered tuple from V of size at most c, every 
u>i is a positive integer. A variable assignment <ft '■ V — > D satisfies a constraint (fi, (si, . . . , s C '), wf) provided 
that f((j)(si), . . . , (f)(s c >)) = 1. Our goal is to find an assignment <f> : V — > { — 1, +1} of maximum weight. 
Here the weight of an assignment <fi is defined to be the total sum of weights wi of constraints (fi, Si,Wi) 
satisfied by 0. For example, Max-E3-Sat is specified by a single predicate P : { — 1, +1} — > {0, 1}, where 
P(x, y, z) = if and only if x = y = z = +1. 

We consider the following parameterization of boolean CSP. 

Max-c-CSP Above Average (c-CSP^a) 

Input: A set V of variables, a collection of constraints C = {(/i, S\,w{), . . . , (/ m , S m , w m )} with 
\Si\ < c for every i, an integer k > 0. 
Parameter: k 

Goal: Determine if there is an assignment with weight at least p ■ W + k/2 c , where W is the 
total weight of all constraints and p is the expected fraction of weighted constraints satisfied by 
a uniform random assignment. 

In the permutation CSP problem, the domain D is [n] = {1, . . . , n} and a predicate P G P is a function from 
S c > to {0, 1} for d < c, where S C ' is the set of permutations on {1, ... , c'}. Let P be a set of predicates. An 
instance of permutation CSP is given as a variable set V and a collection C = {(/i, Si, Wi), . . . , (/ TO , S m , w m )}, 
where every Si is an ordered tuple from V of size at most c, every Wi is a positive integer, /j G P is applied 
to the tuple Si. In a permutation CSP, a variable assignment <fi : V — > D is required to be a bijection, 
or equivalently, a permutation. A permutation <fi satisfies a constraint (/i, (si, . . . , s c >), W{) provided that 
f((f>(Si)) — 1, viewing the local permutation (f>(Si) as an element of S C '. Our goal is to find an assignment 
(j) : V — > [n] of maximum weight, where the weight of <j) is the total sum of weights of constraints satisfied 
by (j>. For example, Betweenness is specified by a single predicate of arity 3, i.e. P : §3 — > {0, 1}, where 
P(x, y,z) = l if and only if xyz G {123, 321}. 

We consider the following parameterization of permutation CSP: 

Max-c-Permutation CSP Above Average 

Input: A set V of variables, a collection of constraints C = {{fx, Si, w\), . . . , (/ m , S m , w m )}, an 
integer k > 0. 
Parameter: k 

Goal: Determine if there is an assignment with weight at least p ■ W + fc/(c!4 c ), where W is the 
total weight of all constraints and p is the expected fraction of weighted constraints satisfied by 
a uniform random permutation. 

In what follows, we omit Above Average and simply say Max-c-Lin-2, Max-c-CSP and Max-c- 
Permutation CSP to refer to the parameterized problems. 

For Max-c-Permutation CSP, let P be the associated set of predicates. For each predicate P G P, we 
can identify the set lip = {it G S c : P(tt) = 1}- Notice that (j> satisfies (fi,Si = (vt, . . . , v C '),uii) if and only 
if there is a ir G Hf i s.t. <fi(v i)4>(v2) ■ ■ ■ <fi(v c >) — 7r(l)7r(2) • ■ ■ 7r(c'), and thus if and only if there is ir G LT^ s.t. 

< </>(tV-i(2)) < • • 4 < 0( w 7r- 1 (c'))- The folklore result below allows us to focus on the case when P 
contains a single predicate P such that lip = {12 • • • c}, which we denote c-Linear Ordering. The proof 
for the case c = 3 and |P| = 1 can be found in |13j and its extension for arbitrary fixed c is straightforward. 
As the proof of [13] considers only the case when P has a single predicate, we sketch the proof here even 
though its generalization is immediate. 
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Proposition 1. Let (V,C,k) be an instance of c-OCSP aa- There is a polynomial time transformation 
R from Max-c-Permutation CSP to c-Linear Ordering such that an instance (V,C,k) of Max- c- 
Permutation CSP is a yes-instance if and only if R(V,C,k) is a yes-instance o/c-Linear Ordering. 

Proof. From an instance (V, C,k) of Max-c-Permutation CSP, we construct an instance (V, Co,k) of c- 
LlNEAR Ordering as follows. We shall express a constraint (fi,Si,Wi) by a set of constraints R(fi, Si,Wi) 
such that (fi, Si,Wi) is satisfied if and only if exactly one of R(fi, Si,Wi) is satisfied. Let Si be (v\, V2, • ■ ■ , v c i). 

For every element 7r of = {it 6 § c : fi(ir) — 1}, we add to the instance of c-Linear Ordering the 
constraint (Pid, (v^-im, w^-ira), • ■ • , v^-i^,)), Wi). Here is a function mapping identity permutation to 1 
and other permutations to 0. Notice that (fi, Si, Wi) is satisfied if and only if exactly one of the constraints 
(Pidi 1(1), w 7r- 1 (2)) • • • > v -k- 1 (c'))i w i) f° r n £ Il fi is satisfied. Hence the weight of a linear ordering <f> 
remains the same in the original and transformed instances. Moreover, the expected satisfied fraction of the 
constraint (fi, Si,uii) is n^|/(c')! and the expected satisfied fraction of the new constraints is the same. 
Hence, the instance of Max-c-Permutation CSP has a linear ordering of weight p ■ W + k if and only if 
the constructed instance of c-Linear Ordering a linear ordering of weight p ■ W + k. □ 

It is well-known that for every function / : {1,-1}" — > R can be uniquely expressed as a multilinear 
polynomial 

f(x) = f(S)xs(x), 

SC[n] 

where f(S) is the fourier coefficient of / on S, defined as 

f(S) := E [f(x) Xs (x)] 
xe{i,-i}™ 

and the character function xs(x) is defined as xs(x) '■= Y[ies Xi - Given the truth table of /, the fourier 
coefficients of / can be computed via the inverse fourier transform (one reference is [19)). 

3 Max-c-Lin-2 Above Average 

We now turn to describing improved parameterized algorithms for maximum constraint satisfaction problems 
with a constant number of variables per constraint, including the problems of satisfying a maximum subset 
of linear equations and maximum CNF satisfiability. At the heart of our approach is a faster algorithm for 
Max-c-Lin-2 Above Average that can be applied in a general way to solve other CSPs. 

Theorem 2. For every c > 2, Max-c-Lin-2 Above Average can be solved in 0(2^ c+1 ^ 2 " lk ■ m) time. 

In [5D], the authors gave a "hybrid algorithm" for the unweighted problem Max-E3-Lin-2 (where exactly 
three variables appear in each equation), with the property that, after a polynomial time test of the instance, 
the algorithm either outputs an assignment satisfying (1/2 + e)m equations in polynomial time, or outputs 
the optimal satisfying assignment in 2°^ em ^ time. The algorithm works by finding a maximal subset of 
equations such that every pair of equations share no variables; based on the size of this set, the hybrid 
algorithm decides to either approximately solve the instance or solve it exactly. Our algorithm is in a similar 
spirit, but requires several modifications to yield a parameterized algorithm for the weighted case, to deal 
with any c > 2, and to deal with "mixed" equations that can have different numbers of variables. 

Let F be a set of equations over F2, where each equation e contains at most c variables and has a positive 
integral weight w(e). For a single equation e 6 F, let var(e) be the set of all variables appearing in e. Let 
var(F) = U e eF var ( e )- F° r a set of equations F 1 , the weight w(F') is the sum of weights w(e) over e 6 F 1 . 
The weight of an assignment is the total weight of equations that are satisfied by the assignment. 

Note that Max-2-Lin-2 Above Average is a generalization of Max Cut Above Average on weighted 
graphs: by simulating each edge {u, v} of weight w with an equation x u + x v — 1 of weight w, the Max-2- 
LlN-2 problem easily captures Max Cut. 



4 



We assume that the given instance is reduced in the sense that there is no pair of equations e, e' with 
e = e' + 1 (mod 2). (Such an equation e is said to be degenerate in 20 .) If such a pair exists, one can 
remove the equation of lesser weight (call it e') and subtract w(e') from w(e). Note the weight of every 
variable assignment has now been subtracted by w(e'). 

Proof of Theorem [2j It is convenient to view an equation e as a set var(e). We first find a maximal 
independent (i.e. disjoint) collection S c C F of c-sets. More precisely, we treat each equation as a set of 
variables, ignore those sets of cardinality less than c, and find a maximal disjoint set over the c-sets using 
the standard greedy algorithm. All remaining equations in F now have at most c — 1 variables if we remove 
all occurrences of variables in var(S c ) from F. 

Next, we pick another collection S c -i Q F of sets with the property that, after we remove all variables in 
var(S c ) from F, 5 c _i forms a maximally independent collection of (c — l)-sets in the remaining set system. 
In general, for j = c — 2 down to 1, once the variables in var(S c U • • • U Sj+i) have been removed from the 
remaining equations, a maximal independent set of j-sets is chosen greedily, and we set Sj to be a collection 
of corresponding original sets in F (with the variables in var{S c U • • • U Sj+i) added back). We continue until 
Si, in which each set in the collection has exactly one variable after those in var{S c U • • • U S2) have been 
removed. For convenience, let S c +i = var(S c +x) = 0. By properties of maximal disjoint sets, we have: 

Observation 1. For every 1 < j < c, eliminating the variables appearing in var(S c U • • • U Sj+i) leaves at 
most j variables in every equation of F. 

Now, either (1) w(Sj) < k for every j = 1, . . . , c, or (2) there is a j such that w(Sj) > k. 

Case (1) is easily handled: for every j, each equation in Sj contains j variables which do not appear in 

S c U • • • U Sj+i. Hence, \var(F)\ = \var{{j c l=l S;)| < ck + (c - l)k H h k < (c(c + l)/2) • k. By trying all 

C(2( c ( c + 1 )/ 2 )' c ) assignments to var(F), we can find an optimal assignment for F. 

Case (2) is more delicate and is handled by the two claims below. We will show that in this case, F is 
a yes- instance of the problem and we can efficiently recover a solution for it. Recall an equation e G F is 
non-degenerate if there is no e' G F such that e = e' + 1 (mod 2). As mentioned earlier, we may assume 
without loss of generality that every equation in F is non-degenerate. 

Claim 1. For every I < j < c, a random assignment satisfying all equations in Sj will satisfy every non- 
degenerate equation in F — Sj with probability 1/2. Moreover, we can output such a random assignment in 
polynomial time. 

Proof. To prove the first part of the claim, it suffices to show that no equation e G F — Sj (or its negation 
e + 1) can be expressed as a linear combination of one or more equations in Sj. Put another way, we will 
show that every equation in e G F — Sj is linearly independent of the equations in Sj . 

Suppose there are equations ei, . . . , e m from Sj such that their summation (modulo 2) results in a variable 
subset that is equal to the set of variables in another equation e £ F. That is, viewing ei, . . . , e m and e as 
indicator n-bit vectors (one bit for each of the n variables, omitting the constant terms in the equations), 
we have e = 2fcLi e « (mod 2). Recall that every equation in Sj has j variables which do not appear in 
var(S c U • • ■ U Sj+i), and every pair of equations in Sj involves disjoint sets of variables, by construction. 
Hence, if m > 1, then the equation e (composed of variables from ex,... ,e m ) has more than j variables 
which do not appear in var(S c U • • • U Sj+i), which is impossible by Observation [TJ Therefore m = 1, and 
every subset {ei, . . . , e m } of equations from Sj whose modulo sum is the same as another equation e £ F — Sj 
has cardinality I. But then the equation e is degenerate, which is a contradiction to the non-degeneracy 
assumption. Therefore no non-degenerate equation in F (or its negation) can be represented as a linear 
combination of one or more equations from Sj . 

Now, given that every non-degenerate equation in e £ F — Sj is linearly independent of the equations 
in Sj, we claim that a random assignment that is consistent with the equations in Sj will satisfy e with 
probability 1/2. This is a simple consequence of linear algebra over F2. Put the system of equations Sj in 
the form Ax = b, where A e F^ |x ™, x G F 2 \ and b £ F 2 Sjl . Let e £ F - Sj. Define B e £ F (l^l +1 ) x ™ 
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to be identical to A in its first \Sj\ rows, and in the last row, B e contains the indicator vector for the 
variables of e. Define c e G F^ 3 '"^ 1 to be identical to b in its first \Sj\ components, and c contains the constant 
term of e in its last component. Saying that e € F — Sj is linearly independent of Sj is equivalent to saying 
rowrank{B e ) = rowrank(A) + 1, and the set of solutions to Ax — b contains the set of solutions to B e x — c e . 
The number of solutions to a system of rank r is 2 n ~ r . Therefore a uniform random variable assignment 
that satisfies Ax = b will also satisfy B e x = c e with probability 1/2. 

Finally, we describe how to produce a uniform random assignment over all assignments that satisfy the 
equations in Sj. Produce a random assignment to the variables in var(S c +i U • • • U Sj+i), then produce a 
random assignment to those variables in the maximal independent collection of j-sets obtained after removing 
var(S c +i U • • • U Sj+i), in such a way that every equation in Sj is satisfied. (Exactly one variable in each 
equation of Sj will be "forced" to be a certain value, but note that none of these forced variables appear 
in more than one equation of Sj, by construction.) The remaining variables are set to or 1 uniformly at 
random. Note that if j = 1 and some equation e £ Si has \var(e)\ — 1, the assignment to the variable of e 
is decided uniquely. □ 

Claim 2. If there is a j with w(Sj) > k, then we can find an assignment with weight at least W/2 + k/2 in 
polynomial time. 

Proof. Suppose that j > 1 is the largest integer with w(Sj) > k. By Claim[l] a random assignment satisfying 
all equations in Sj will satisfy every other non-degenerate equation with probability 1/2. Hence the weight 
of such an assignment is at least (W — w(Sj))/2 + w(Sj) > W/2 + k/2 on average. An assignment can also 
be found dctcrministically using conditional expectation. □ 

This completes the proof of Theorem [2] □ 

The above proof shows that the following stronger statement is also true. 

Theorem 3. For every c > 2, let I be an instance of Max-c-Lin-2 Above Average. If \var(F)\ > 
(c(c + l)/2)k, then I is an yes-instance and an assignment satisfying equations with at least W/2 + k/2 
weight can be found in polynomial time. 

Observe that the running time of our algorithm is optimal up to constant factors in the exponent, assuming 
the Exponential Time Hypothesis: 

Theorem 4. // Max-3-Lin-2 Above Average can be solved in 0(2 ek 2 £m ) time for every e > 0, then 
3SAT can be solved in 0(2 Sn ) time for every S > 0, where n is the number of variables. 

Proof. First, by the improved Sparsification Lemma of [4], for every S > we can reduce 3SAT on n variables 
and m clauses in 2 5n time to 3SAT on n variables and ml = (1/S) c n clauses, for some fixed constant c > 1. 
This 3SAT instance on n variables and m! clauses can further be reduced to Max-3-Lin-2 on n variables and 
0{m!) clauses using the reduction of Lemma [T] (proved below). Provided that we can determine whether 
m'/2 + k/2 equations can be satisfied in 2 ek 2 em time, then by trying each k in the interval [l,m'] we can 
solve the Max-3-Lin-2 instance exactly in at most 2 2ern ' < 0{2 2 ^ 1 ^^ n ) time. 

This results in an 0(2 5n+2£< ~ 1 / s ^ n ) algorithm for 3SAT. Setting e = 5 C+1 , we obtain 0(2 35 ") time. As this 
reduction works for every 5 > 0, the conclusion follows. □ 

4 Boolean MAX-c-CSP Above Average 

To apply our algorithm to general CSPs, we use the following reduction. 

Lemma 1 ([T],[B]). There is a polynomial time reduction from Max-c-Csp Above Average with n variables 
and parameter k to Max-c-Lin-2 Above Average with n variables and parameter k. 
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The proof of the lemma for unweighted c-CSP is sketched in [I], and a full proof is given in [5J. Here we 
give an alternative proof which also covers the weighted case. (Although we are confident that the proofs 
in jTJ [5] also extend to the weighted case, we include a proof in Appendix IA1 for completeness.) 

Theorem 5. For every c>2, Max-c-CSP Above Average can be solved in 0(2( c ( c+1 )/ 2 ) fc • m) time. 

Proof of Theorem [5j Using the reduction of Lemma [IJ reduce an instance of Max-c-CSP Above 
Average with m constraints to Max-c-Lin-2 Above Average with 0(2 C ■ m) equations. Using the 
algorithm of Theorem [JJ we solve the obtained instance of Max-c-Lin-2 Above Average. Thus we can 
determine if the given c-CSP has an assignment with weight at least AVG + k in 0(2^ c+1 ^ 2 ' >k ■ 2 C • m) = 
(2(c(c+i)/2)k . m ) time To nn di n g a n actual solution for Max-c-CSP Above Average, we can simply use 
the transformation given in the proof of Lemma [T] □ 
Theorem [3] and Lemma [1] show in fact that every CSP admits a hybrid algorithm [20] . 

Reminder of Theorem [T] For every Boolean Max-c-CSP, there is an algorithm with the property that, 
for every e > 0, on any instance I , the algorithm outputs either: 

• an optimal solution to I within (9*(2 c ( c+1 ) em / 2 ) time, or 

• a {p + e /2 C+1 )- approximation to I within polynomial time, where p is the expected fraction of weighted 
constraints satisfied by a uniform random assignment to the CSP. 

Proof. Given an instance I of Max-c-CSP with m constraints, Lemma [TJ shows that we can reduce / to an 
weighted instance I' of Max-c-Lin-2 with 0(2 c m) equations, in polynomial time, such that at least pm + Sk 
constraints can be satisfied in / if and only if at least W/2 + k/2 weight of equations can be satisfied in 
where S > 1/2° and depends on the underlying constraints. Now set k = em and run the algorithm of 
Theorem El If \var(I')\ > (c(c + l)/2)fc, then an assignment satisfying at least W/2 + k/2 > W/2 + em/2 
weight of equations can be found in polynomial time, hence we obtain an assignment for / satisfying at least 
pm + Sem/2 constraints. Otherwise, exhaustive search over the c(c + l)em/2 variables of /' will uncover an 
exact solution to / in O* (2 c ( c + 1 ) £m / 2 ) time. □ 

We close this section with showing how our algorithm can provide linear size kernels for Max-c-Lin-2 
Above Average and a kernel for Max-c-CSP Above Average. 

Corollary 1. For every c > 3, the problem Max-c-Lin-2 Above Average can be reduced to a problem 
kernel with at most (c(c + l)/2)fc variables in polynomial time. 

Proof. Consider executing the algorithm of Theorem [5J up to the point before it performs an exhaustive 
search of assignments. At this point, the algorithm has taken only polynomial time. If there is a Sj with 
weight at least k, the algorithm outputs an assignment with weight at least W/2 + k/2 in polynomial time. 
Otherwise, for all j = c, c — 1, . . . , 1, Sj has weight less than k. It follows (from Case (1) in the proof of 
Theorem [2j that the total number of variables in the instance is at most (c(c -I- l)/2)fc. □ 

Note that the size of the kernel in Corollary [T] matches the prior work for c = 2 pQ . 

Corollary 2. For every c > 3, the problem Max-c-CSP Above Average can be reduced to a problem 
kernel with at most (c(c + l)/2)fc variables in polynomial time. 

Proof. In the proof of Theorem 1 in p], a procedure V is given that reduces any instance of Max-c-Lin-2 
Above Average with total sum of weights W and parameter k into an instance of Max-c-CSP Above 
Averagb3 with (a multiset of) 2 C ~ 1 W constraints and parameter 2 c ~ 1 k. More precisely, the procedure 
V considers an instance of Max-c-Lin-2 Above Average in which each equation has weight 1 and the 
multiplicity of an equation may be larger than one. V maps an equation into a set of 2 C_1 clauses. 

In pQ, the transformed instance is in fact a Max-c-SAT instance. 
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Given a Max-c-CSP Above Average instance on n variables and m constraints, we first perform the 
transformation given by Lemma [1] and obtain a Max-c-Lin-2 Above Average instance, with 0(2 c m) 
equations and n variables. By applying the kernelization of Theorem [5] we obtain an equivalent instance 
with at most (c(c + l)/2)k variables and no more than (c(c + l)k/2) c (weighted) equations. Finally, apply 
procedure V to reduce the problem back into a Max-c-CSP Above Average instance, having (c(c+ l)/2)k 
variables, 0(2 C ~ 1 ■ (c(c+ l)k/2) c ) constraints and parameter 2 c ~ 1 k. □ 



5 MAX-c-Permutation CSP Above Average 

In Section [31 we proved that every reduced instance of Max-c-Lin-2 either has an assignment satisfying 
W/2 + k weight of equations, or has at most (c(c + l)/2)k variables. In this section, we describe how to 
apply this result to the problems Max-c-Permutation CSP for c — 2,3 and obtain a bikernel with O(k) 
variables. By Proposition [TJ it suffices for us to focus on the problem 3-LlNEAR Ordering instead of 
considering general Max-c-Permutation CSP. The outline of our proof is as follows. We first introduce 
new reduction rule for simplifying 3-linear ordering instances. Then we reduce a given instance to another 
intermediate problem called ^-Ordering. Finally we show how to reduce i-ORDERiNG to MAX-3i-LlN-2, 
so we can apply our results from the previous section. 

Recall that in 3-Linear Ordering, we are given a collection C of constraints over a variable set V 
with weights Wi, sometimes denoted as Wi(e{), and an integer k > 0. A constraint e,; is an ordered tuple 
of size at most three, i.e. = (vi,V2,V3) or a = (vi, - ^). Constraint is satisfied by a linear ordering 
4> if 4>(vi) < 4>(v2) < 4>(v3) (or 4>(vi) < 4>(v2), respectively). Let w(4>,C) be the total sum of weight of 
constraints satisfied by <j>. Our task is to decide if there is a linear ordering : y — > [n] of weight w(4>, C) at 
least p ■ W + k, where W is the total weight of constraints in C and p is the expected fraction of weighted 
constraints satisfied by a uniform random assignment. 

For an ordering constraint e = (vi,V2,Vs) S C, let var(e) — \y\, i>2, v^} and let e(i) denote the z-th 
cordinate of e. Let A(u,v, w) be the set of all possible six constraints on variables u,v,w. Let i>i,i>2,t>3 be 
a triple of variables and suppose A(«i, 1*2, U3) fl C ^ 0. Then C{u, v, w) is defined as A(vi, 1*2, V3) n C. Given 
a constraint e = (vi,V2,Vs), we simplify A(ui, i>2, V3) and C{v\, 1)2, V3) as A(e) and C(e) respectively. 

We say that two distinct constraints e, e' E A(i>i, V2,vs) are a reverse pair if the ordering of e' = 
(e(3), e(2), e(l)) or equivalently, e(2) = e'(2). We say that the reverse pair e,e' E A(«i, U2, V3) is absent 
in the instance / = (V, C, k) if none of e and e' appear in C. 

We have the following reduction rules for 3-Linear Ordering which were known in the literature and 
their correctness is obvious. 

Redundancy Rule Remove a variable v from V if it does not appear in any constraint. Remove a 
constraint e from C if its weight is zero. 

Merging Rule: If e\ and e^ are identical, then replace them by a single constraint e\ of weight w\ + u>2- 
Cancellation Rule: If there are two constraints e±, &2 with \e\\ = |ea | = 2 and e2 = (ei(2), ei(l)), let 
Wmin = min{w(ei), w(e2)} and replace the weights by w'(e{) := w{e{) — w m i n and w'(e<2>) := w{e2) — w m i n . 

Here we introduce two new reduction rules. 
Edge Replacement Rule: If ei,e2,e3 are three constraints in C with var(e\) — var(e2) = var(ez) and 
such that e2 = (ei(2), ei(l), ei(3)) and e% = (ei(l), ei(3), ei(2)), then: 

• replace the weight of a constraint by w'(ei) :— w(ei) — w m i n for each i = 1,2,3, where w m i n — 
min{w(ei), w{e 2 ),w(e 3 )}. 

• add the binary ordering constraint (ei(l), ei(3)) of weight w m i n . 

Cycle Replacement Rule: If ei,e2,e3 are three constraints in C with var{e\) = var{e2) — varies) and 
such that e2 = (ei(2), ei(3), ei(l)) and e 3 = (ei(3), ei(l), ei(2)), then: 
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• replace the weight of a constraint by w'(ei) :— w(ei) — w m i n for each i = 1,2,3, where u> m m = 
min{w(ei), to(e 2 ), u>(e 3 )}. 

• add the three binary ordering constraints (ei(l), ei(2)), (ei(2), ei(3)) and (ei(3), ei(l)), each of weight 

Wmin • 

Lemma 2. The Edge Replacement and Cycle Replacement Rules are sound. 

Proof. Let (V, C, k) be an instance of 3-Linear Ordering and (V, C, k) be an instance obtained by applying 
one of the reduction rules. To see that Edge Replacement Rule is sound, we observe that w{<f>,C) — w{<f>,C) 
and the claim follows. For the Cycle Replacement Rule, w((p,C) — w(4>,C) —w m i n as any ordering <fi satisfies 
at least one of the three binary ordering constraints added and (j> satisfies two of them in C if and only if it 
satisfies one of three cyclic order constraints in C. Therefore, there exists a <f> such that w(<f>, C) > p ■ W + k if 
and only if w(<j>, C) > (p ■ W + w m in) + k. It remains to observe that the average weight of w(<j), C) increases 
by w m i n by the reduction, which completes the proof. □ 

Note that we can apply all the above reduction rules exhaustively in polynomial time. We say that an 
instance I — (V,C, k) is irreducible if none of the above five reduction rules can be applied. We are ready to 
state the main result of this section. 

Theorem 6. Let I = (V, C, k) be an irreducible instance o/3-Linear Ordering. If I is a no-instance (i.e., 
less than pW + k constraints in I can be simultaneously satisfied), then the number of variables in I is less 
than 15/c variables. 

The rest of this section will be devoted to proving Theorem [6] We start with an observation about the 
irreducible instances. 

Lemma 3. Let I — (V,C,k) be an irreducible instance of 3-Linear Ordering. Then for every triplet 
vi,V2,vz with C(vt, i>2) W3) 7^ 0, there is an absent pair e, e' £ A(v%,V2, V3) in I. 

Proof. Suppose that I has no absent pair in A(i>i, i>2, V3). Then for each Vi, there exists a constraint 
e, G C(vi,V2,v 3 ) with e^(2) = Uj. Consider the set {ei(l), e 2 (l), e 3 (l)} consisting of the first variables of 
ei, e-i and e 3 . There are two possibilities: (a) {ei(l), e2(l), 63(1)} = {vi, v%, V3} and (b) {ei(l), e 2 (l), e 3 (l)} = 
{vi,V2,v 3 } \ {vi} for some i. One can verify that the Cycle Replacement Rule applies in the case (a) and 
the Edge Replacement Rule applies in the case of (b). □ 

We now reduce 3-linear ordering to another problem that maps variables to bit vectors. This representation 
will be useful in the final reduction to Max-c-Lin-2. A ^-ordering is a mapping dn : V — > {1,-1}*. The 
vectors in {I,— I}* are totally ordered by lexicographic order, where (+1) < (—1)13 For example, we have 
(+1, +1, —1) < (+1, — 1, +1). As the vectors {1, —1}* are totally ordered, we interpret them as positions in 
2* buckets. Any i-ordering <f> t can be extended into a linear ordering <f> : V — >■ N which preserves the bucket 
ordering imposed by (f>t, so that <p(vi) < 4>{vj) whenever <f>t{vi) < 4>t(vj). We say <p is an extension of <pt if it 
preserve the bucket ordering imposed by <f> t . 

The problem £-Ordering is to find (or determine that there is no) i-ordering <pt : V — > {1, —1}' with 
weight w t (4>t,C) at least pt ■ W + k, where W is the total weight of all constraints and pt is the expected 
fraction of weighted constraints satisfied by a uniform random assignment, i.e., pt -W := E[w t (4>t, C)\. 

We can view an instance (V,C, k) of 3-Linear Ordering as an instance of £-Ordering by introducing 
a new payoff function. For each constraint e = (vi,i>2, V3), a payoff function g e : ({1, — l}*) 3 — > [0, 1] maps 
x = {x\, ■ ■ ■ ,X3t) S ({1, — l}') 3 , a t-ordering <j) t restricted to the tuple e, to the probability that its random 
extension satisfies e. The weight of e remains the same. Note that g e (x) is nonzero if and only if the three 
vectors {x\, . . . , Xt), (i*+i, ■ ■ ■ , X2t) an d (x2t+i, ■ ■ • 5 ^3t) are lexicographically non-decreasing. By wt(<fit,C), 
we refer to the value X^eeC We ' 9e(4>t)- 

2 The reason for this "funny" ordering is that we want true to correspond to —1, and false to correspond to 1, so that the 
parity function can be represented by multiplication, and the empty set of variables will correspond to false (i.e., even parity). 
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As the function g e is defined on x G {1, — l} 3t , it can be uniquely represented as a multilinear polynomial 

SC[3t] 

Consequently, the weight of a i-ordering fa is expressed by the polynomial g(x) = Yl e &C w e ' 9«{ x ) on 
x G {1, —1}*". In other words, for every i-ordering fa we have g[x) = Wt(fa,C). Here (x^_i) t+1 , . . . , xu) '■= 
fa{vi) for each i £ [n]. The polynomial g'(aj) can be rewritten as follows: 

SO) = E We ' ffe ^ = E We ' E 5e( 5 )'Xs(^) 
e6C eSC SC[3t] 

E ( E We ^( 5 ') I Xs{x) 

SQ[nt],\S\<3t Ve6C / 

E (!) 

SC[ni],|S|<3i 

The following observation is folklore. 

Observation 2. Lef (F,C,fc) 6e an instance o/3-Linear Ordering. Then p = p t . In particular, if every 
constraint e 6 C has arity exactly 3, we have p = pt = 

Proof. Let g e (x) be the payoff function for constraint e in the associated t- Ordering instance. We have 
Se'eA(e) E[<7 e ' (0t)] = 1 since we satisfy exactly one constraint from A(e) regardless of fa. Moreover, we 
know E[g e '(fa)] takes the same value for all e' £ A(e). For |A(e)| = |e|!, it follows that E[g e (</> t )] = tKj. 
Observe p t -W — E[g(x)] = J2eec We E[<7e(z)] = J2e&c We |i|T = P ' This completes the proof. □ 

By averaging argument, we observe the following. 

Observation 3. Let (V,C,k) be an instance of 3-Linear Ordering, for any t-ordering <j>, we have 
Wt(fa,C) < max^w((^,C), where <f> is taken over all extensions of fa. 

An easy consequence is: 

Observation 4. Let (V,C,k) is an instance of 3-Linear Ordering. // it has a t-ordering fa with 
Wt(fa,C) > p ■ W + k, then there is a linear ordering (j) with w(4>,C) > p ■ W + k. 

Observation 5. For a constraint e of arity c, the fourier coefficients of g e are with integral j 6 

[— c!2 ct , c!2 ct ] . Accordingly, the same condition holds for the fourier coefficients of g. 

Proof. Given two vectors u and v, denote their concatenation mod by the vector w where wt — Ui for all 
i = 1, . . . , and u>| u |+i = Vi for all j = 1, . . . , \v\. It is not difficult to see that g e (x) = l/(ni! • ■ • ■ • ^2*0 
if 

x = (+1, +1, • • • , o (+1, •••,+!, -I)" 2 o • • ■ o (-1, -1, • • • , -l)" 2f , 

and g e (x) takes otherwise. Here n\ + • • • + n 2 * = c. Hence, g e {x) — j/cl for some integer j £ [0,c!]. The 
claim follows from the the inverse fourier transform g e {S) — E X £{i^iyct [g e {x)xs{x)} and the fact that the 
weight of a constraint e is integral. □ 

Recall that every monomial in the polynomial g(x) has degree at most 3i. Therefore, the polynomial can 
be interpreted as an instance of M AX-3i-LiN-2 Above Average. Indeed, associate each monomial on the 
subset S C [tn] in g(x) with a linear equation X^es = of weight 3!2 3 * ■ g(S) if g(S) is positive, and an 
equation J^ieS = 1 of weight — 3!2 3 * • g(S) if g(S) is negative. (Here we multiply by the factor of 3!2 3 * to 
make the weights integral.) Let F(C) be the instance of MAX-3i-LlN-2 Above Average constructed from 
an instance C of i-ORDERiNG with arity at most three. The next lemma claims that F(C) is equivalent to 
the i- Ordering instance. We omit the proof, which is essentially the same as Lemma [TJ 
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Lemma 4. Let (V,C,k) be an instance of £-Ordering. There exists a t-ordering <j) t with w(cj>t) > p • W + 
k/(c\2 tc ) if and only if the corresponding instance F(C) o/Max-c-Lin-2 Above Average has an assignment 
of weight at least Wpic\/2 + k/2, where Wptc) is the sum of weight of equations in F(C). 

Let F(C) be an instance Max-3£-Lin-2 Above Average obtained from t- Ordering instance (V,C,k). 
We can apply Theorem [3] from the previous section, and either decide that F(C) is a yes-instance or that 
F(C) has at most (3t(3t + l)/2)fc variables in polynomial time. By Lemma [H the ^-Ordering instance is 
yes in the first case and thus, the 3-Linear Ordering instance is yes as well by Observation @] The second 
outcome, however, does not necessarily imply a kernel for the 3-Linear Ordering instance (V,C, k). 

To illustrate, let us consider Max Acyclic Subgraph. Let us take t = 1 and consider the associated 
t-ORDERiNG instance. For an edge constraint e = (v\,V2), we have the payoff function g e (vx,V2) = 1/2 + 
Ui/4 — U2/4 for t-ordering. Consider an instance graph C n = v\, . . . ,v n , in which the oriented edges of C n 
form the constraints. Then Wt(4>t, C n ) = n/2 for any <j) t G {1, —1}™. Hence the corresponding Max-2-Lin-2 
Above Average instance F(C n ) is a no-instance and the number of variables is bounded (in fact, it has 
no variables). But there is a linear ordering tf> with w(<j), C n ) = n — 1, and thus C n is a yes- instance for any 
k < (n — 2)/2. Moreover, the instance C n is irreducible. Hence the gap between the number of variables in 
a no-instance F(C n ) and \V\ in the 3-Linear Ordering can be arbitrarily large. 

In the example above, every variable in V = {v%, . . . , v m } for C n disappears in the instance F(C) when 
t = 1. We show that t — 2 is enough to ensure that every variable of V is "represented" in F(C). 

Let (V,C,k) be an instance of 3-Linear Ordering and t — 2. In the polynomial g(x), we construe the 
variables x%i— 1, X2i as representing the position of the variable m in a t-ordering <p t , i.e. (j)t( v i) — (^2i-i> xn)- 
We say that a variable £ V is represented in the associated ^-Ordering instance if x%i—i or x^i appears 
in the instance F(C). We also say that a variable Vi G V is represented in a monomial g(S)xs( x ) if 
5fl {2i — 1, 2i} 7^ and 5(5) ^ 0. Obviously, is represented in the associated t-ORDERiNG instance if and 
only if there is a monomial in g(x) representing Uj. 

An important observation is that a monomial of <7 e (^) which represents every variable of var(e) for a 
constraint e cannot be canceled out in g{x) by a monomial generated by a constraint e' with |e| = |e'| and 
var(e') ^ var{e). Therefore, we shall study the sum 5Z e 'ec(e) We ' ' 9e'{ x )i an d prove that there is a single 
monomial in the sum representing every variable of var(e). 

Lemma 5. Let I = (V,C,k) be an irreducible instance and t — 2. For a constraint e — (^1,^2) £ C, the 
fourier coefficient g e (S) is nonzero for S = {1,3,4}. 

Proof. Given two vectors u and v, denote their concatenation mod by the vector w where it), = u, for all 
i = 1, . . . , |tt|, and if| u |+j = Vi for all j = 1, . . . , We recall that g e (x) is nonzero only if (xi, X2) < (^3, X4). 
By the inverse fourier transform, we have 



9e(S) = B [g e (x)xs(x)] = ^ I XI 3e(a;)xs(x) + ^ 5e 



(x)xs(x) 



where X{ G {1, — l} 2 . If x is of the form x 2 as in the first sum, we have xs{ x ) — X1X3X4 — X4 and the sum 
over such vectors equals zero. If x = x\ oi 2 and x\ < £2, then it is tedious to check that the sum equals -2. 
The claim follows. □ 

Lemma 6. Let I = (V,C,k) be an irreducible instance and t = 2. For a constraint e — (1)1,1)2,1)3) G C, the 
fourier coefficient g e (S) is ^2 for S — {1, 3, 5, 6} and S = {1, 2, 3, 5}. 

Proof. Recall that we denote the concatenation of two vectors u o v by the vector w where lUj = for all 
i = 1, . . . , \u\, and w\ u \ + i = Vi for all j — 1, . . . , 
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Consider S = {1, 3, 5, 6}. The case when S = {1, 2, 3, 5} is symmetric. We have 

9e(S) 

= ^ I ■9e(a;)xs'(a;) + Y 9e{x)xs{x) + ^ g e (x)xs(x) + 9e(x)xs(x) 

\x=x^ x=x\ox 2 x=x\Ox\ x=X\<>xi<>X3 i 

where Xi G {1, — l} 2 and the sum is taken over all x — x\ 0x20x3 such that X\ < x% < #3. If x = x 3 or of the 
form x\ 0x2, we have xs(x) = x^x^x^xq = x§x§ and the sum over such vectors equals zero. If x = x\ ox\, 
we have xs(x) = x\x§ it is tedious to check that the sum equals zero. If x is of the form x\ oi 2 0x37 observe 
that the sum equals -2. The claim follows. □ 

Lemma 7. Let I = (V,C, k) be an irreducible instance and t — 2. Then every variable v G V is represented 
in the associated ^-Ordering instance. 

Proof. Suppose v G V appears in some ternary constraint e G C. Consider the sum J2 e 'ec(e) We ' ' 9e'{x). By 
LemmaEl we know that there is an absent pair ei, 62 such that var{e\) — var{e2) — var(e) and e\, e2 C(e). 
Let e\ — [v\,V2,Vz). We take S = {1,3,4,5}. Then by LemmaH the fourier coefficient g e >(S) is negative 
for every e' G C(e). Observe that the monomial corresponding to S in the sum X) e 'eC(e) 9e'( x ) represents all 
variables in var(e). Moreover, this monomial is not canceled out in g(x). Therefore, we conclude that v is 
represented by the associated ^-Ordering instance. 

If v G V does not appear in any ternary constraint, then there is a binary constraint e G C in which v 
appears since the instance is reduced with respect to Redundancy Rule. Let e = (ui,w 2 )- As the instance is 
reduced with respect to Cancelation Rule, (v 2 ,vi) £ C. Now Lemma [S] implies that v is represented by the 
associated i- Ordering instance. □ 



Proof of Theorem \E[ Let I = (V, C, k) be an irreducible instance of 3-Linear Ordering and assume I is 
a no-instance. By Observation SI the associated i-ORDERiNG instance is no. By Lemmas |4] and Theorem[2j 
the instance F(C) of MAX-6-LIN-2 Above Average has less than 15k variables. Finally, Lemma [7] says 
that every variable in V is represented in the associated i-ORDERiNG instance and thus \V\ < 15k. □ 
The problem kernel for 2-LiNEAR Ordering follows immediately. 

Corollary 3. Let I — (V,C, k) be an irreducible instance o/2-Linear Ordering. If L is a no-instance, we 
have \V\ < 10k variables. 

Proof. The proof remains the same as in the proof of Theorem [S] except that the instance F{C) of Max-4- 
Lin-2 Above Average has less than lOfc variables. □ 

Combined with an exact 0*(2™)-algorithrrjf|, our result in this section leads to the following result. 

Theorem 7. The problem Max-c-Permutation CSP Above Average can be solved in time 2°^ for 
c = 2,3. 

Proof. Using Proposition [U Max-c-Permutation CSP Above Average instances can be transformed 
into an equivalent instance of c-Linear Ordering. Due to the result stated as Theorem|6]and Corollary[3l 
we either know the instance is a yes-instance or attain an equivalent instance with less than 15k variables for 
c = 3 (or an instance with less than lOfc variables for c = 2). Hence, an 0*(2 ra )-algorithm to exactly compute 
the maximum number of satisfiable constraints on a n-variable instance of Exact 3-Linear Ordering will 
yield a desired result. Here, an instance / of Exact 3-Linear Ordering is given as a pair (V,C) and the 
task is to find a linear ordering on V so as to maximize the number of satisfied constraints in C. We give an 
exposition of such an algorithm for c = 3. An analogous observation applies to c = 2. 

3 The notation O* suppresses polylogarithmic factors 
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Bodlacnder et. al [5] presents algorithms for Vertex Ordering problems which runs in 0*(2 n ) time and 
0*(2 n ) space, or 0*(4") time and polynomial space. Let V be a set of elements, which may be vertices in 
graph problems or variables in our permutation CSP context. For a linear ordering tt on V, we denote the 
set {w £ V : ir(w) < ir(v)} by ir < . v . Consider a function / from the domain of triples (G, S, v) to an integer, 
where G is a graph, S C V(G) and v G V(G). It is shown in [5] that if / is polynomial time computable, 
the value 



where tt is taken over all possible linear ordcrings, can be computed either in 0*(2 n ) time and 0*(2 n ) 
space, or in 0*(4 n ) time and polynomial space. Alerted readers might notice that the former uses dynamic 
programming in Held-Karp style and the latter employs recursion instead. We point out that although 
they describe the algorithms in the context of graph problems, the validity does not depend on whether the 
relations on V are of arity two or not. It remains to formulate the Exact 3-Linear Ordering problem to 
fit in the setting. 

We take f(I,S,v) := \{e = (a,v,c) G C : a G S, c G V \ (S U {v}}\ and note that / can be computed 
in polynomial time given a triple / = (V,C),S C V and v G V. To see that m^J2 v <£V ~f(Ii 7T <-vi v ) 
max .. J2vev /(^> 7r <,i" v ) equals the optimal value of the Exact 3-Linear Ordering instance /, it suffices 
to observe the followings: given a linear ordering tt, (a) the family {e = (a,v,c) G C},v G V partitions the 
constraint set C, (b) a constraint e = {a,v,c) G C contributes one to /(/, 7T< )t ,,u) if and only if 7r satisfies 
e. Finally, we note that the extension of the formulation to weighted instances and instances with (some) 
constraints of arity two is straightforward. □ 

Closing this section, we point out that the recent work in [TU], independently of our paper, also explores 
the idea of ensuring a monomial which represents a variable in the multilinear polynomial. They consider 
the c-Linear ordering problem in which every variable occurs in a bounded number of constraints and 
show that approximation beyond the random assignment threshold 1/c! is achievable. It is interesting to 
note as well that their motivation is to gain over the random assignment threshold. 

Acknowledgement The authors would like to thank Daniel Goncalves for valuable discussion which in- 
spired the results of Section [5] 
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A Proof of Lemma [I] 

Proof of LemmaUl Let F = {(/i, Si,wi), . . . , (/ m , S m , w m )} be a given set of constraints and let AVG be 
the expected weight in F of a uniform random variable assignment. (For definitions, see the Preliminaries.) 
First, convert each constraint function fi : {0, l}' Si ' — > {0,1} into the form f[ : { — 1, l}l Si l — y {0, 1}, replacing 
all 0's with l's and all l's with — l's. This can be easily done via the linear transformation £(x) = 1 — 2x. 
Let F' be the new set of constraints. 

Let Si = \Si\. A well-known fact is that for any function fi : { — 1, l} Si — > {0, 1} there is always a unique 
multivariate polynomial Pi(xx, x Si ) such that for all assignments a G { — 1,1} S % Pi{a) = fi(a). This 
polynomial has the form 




where every = j /2 Si for some integer j S [— 2 Si ,2 Si ]. Define qi{x) = Pi(x) — a^. That is, equals pi 
minus the constant coefficient of pi . 
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We claim that for all assignments a = (ai, . . . , a n ) G {0, 1}", a has weight at least AVG + k in F if and 
only if EHi w i ' 9i W a i): ■ • j ^( a n)) > k. To see this, observe that the sum of all constant coefficients of Pi 
multiplied by the weight Wi is exactly AVG, since this sum is 

E™ i w i ■ a ] = YT=i w * ■ E xe{-i,i}« \Pi{x) - qi{x)] = AVG - J2T=i w i ' E *e{-i,i}™ feO)] 
= AVG YZi W ■ Eiy 4 ] E»€{-i,i}» [IUt *i] ^ AVG, 

where the last equality follows because any nontrivial product of random variables over { — 1,1} has ex- 
pectation zero. Hence a {0, l}-assignment with weight w in F translates directly to a { — 1, l}-assignment 
that makes J2i Wi ■ <li — w — AVG. 

Now we reduce the problem of finding a { — 1, l}-assignment such that Ei 9* — ^/2 C to finding a {0, 1}- 
assignment to a Max-c-Lin-2 instance with weight at least W/2+k/2. Let r(x\, . . . , x n ) — J2i Qi( x ii • ■ • j x n)- 

Associate each monomial in r{x) with a linear equation e(T) = of weight 2 C -^™ 1 Wi -a^ , if this quantity 
is positive. If this quantity is negative, then associate with the equation e(T) = 1 of weight — 2 C -^™ 1 Wi-ot^ ■ 
Here e(T) = EieT f»- Notice that we need the 2 C factor in the weights in order to make them integral. For 
any { — 1, l}-assignment x, take y, = fr 1 (x i ) and note that Xi — (— l) Vi . Hence 

m / \ m 

2 c .£^ i .4 ) =2 c -^^-«T(-i) e(T) 

i=l VjGT / i=l 

equals the weight of the corresponding equation e(T) if y satisfies it, and equals the negative of the weight 
of e(T) if y falsifies it. Lastly it remains to observe that any {0, l}-assignment y to a Max-c-Lin-2 instance 
has a weight at least W/2 + k/2 if and only if the weight of satisfied equations minus the weight of falsified 
equations by y is at least k. This shows that if there exists a { — 1, l}-assignment such that J2 i Wi-qi > k/2 c , 
there is a {0, l}-assignment to the corresponding Max-c-Lin-2 instance with weight at least W/2 + k/2. 
The proof of the opposite direction is straightforward. 

In the above construction, Max-c-CSP can be reduced to Max-c-Lin-2 in 0(2 C ■ to) time (where m is 
the number of constraints) and the number to' of equations in the transformed instance will be 0(2 C ■ m) in 
the worst case. □ 
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